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PREFACE 


In  this  Memorandum  the  author  presents  probability 
tables  o£  the  negative  binomial  distribution  for  some 
useful  sets  of  parameter  values.  This  distribution  may 
be  used  as  a  frame  of  reference  for  the  study  of  the 
demand  for  replacement  parts. 
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SU1MARY 


This  Memorandum  presents  cables  giving  the  values  of 
the  individual  terms  of  the  negative  binomial  distribution 
for  130  pairs  of  parameter  values  in  Part  1.  Part  2, 
giving  the  cumulated  terms  of  the  same  distributions,  is 
in  a  form  directly  usable  for  solving  problems  which  re¬ 
quire  the  determination  of  probabilities  of  the  occurrence 
of  not  more  than  x  events. 

The  negative  binomial  !io;:ribution  is  described  and 
illustrative,  examples  are  £wen  which  use  this  distribution 
as  a  tool  n>  the  study  of  demand  for  replacement  parts. 

The  references  indicat  hat  the  negative  binomial 
has  been  used  in  a  wide  variety  of  applications  to  bio¬ 
logical  research. 
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SOMF.  TABLES  OF  THE  NEGATIVE  BINOMIAL  DISTRIBUTION 

AND  THEIR  USE 


1  .  INTRODUCTION 

The  research  worker  in  the  field  of  logistics  is 
frequently  required  to  predict  the  demand  for  spare  parts 
in  order  that  effective  decisions  can  be  made  in  the  areas 
of  procurement,  distribution,  and  stockage  policies.  Many 
such  decisions  are  made  on  the  basis  of  a  specialized 
knowledge  about  an  individual  line  item. 

For  routine  decisionmaking  on  a  large  scale,  it  is 
useful  to  have  a  small  number  cf  known  probability  distri¬ 
butions  which  can  be  used  as  a  framework  to  provide  the 
basic  assumptions  from  which  predictions  of  future  demands 
can  be  made.  If  demand  data  are  available,  an  analyst 
may  study  the  sample  data  and  make  estimates  of  the  param¬ 
eters  of  the  underlying  demand  distribution.  The  informa¬ 
tion  in  the  sample  will  often  be  scanty,  e.g.,  six  months 
experience  at  one  base  or  three  months  of  system-wide 
experience.  A  frame  of  reference  is  needed  to  study  the 
limited  data. 

Some  examples  of  studies  in  which  it  was  necessary  to 
make  an  assumption  about  the  demand  for  spare  parts  in  the 
population  may  be  found  in  the  study  of  procurement  defer¬ 
ral  by  Petersen  [1],  the  design  of  flyaway  kits  by  Karr, 
Geisler,  and  Brown  [2],  and  by  Fort  [3],  the  prediction  of 


demands  by  Goldman  [4],  and  Geisler  and  Brown  [5],  the 
study  of  stockade  policies  by  Ferguson  [6],  Ferguson  and 
Fisher  [7],  and  Petersen  and  Geisler  [8]. 

There  are  many  distribution  functions  which  could  be 
discussed.  Only  two  are  presented  here,  the  Poisson  and 
the  negative  binomial.  These  two  were  chosen  because  they 
are  probability  distributions  of  a  discrete  variable,  and 
demand  for  spare  parts  is  by  nature  integral,  and  also 
because  we  believe,  hopefully,  that  they  may  describe  the 
universe  of  demands  from  which  the  sample  emanates.  These 
distributions  are  attractive  also  because  of  their  easy 
computation. 

2.  THE  POISSON  DISTRIBUTION 

When  we  deal  with  phenomena  involving  events  that 
occur  randomly  in  time  or  particles  that  are  randomly 
distributed  in  space,  the  Poisson  process  is  the  model 
used.  An  experiment  is  performed  and  "events"  are  tallied. 
These  events  can  be  described  by  a  function  x  •  x(t),  which 
gives  the  number,  x,  of  events  observed  during  the  first 
t  units  of  observation  for  all  values  of  t  from  0  through 
T  (where  T  is  the  total  number  of  observations).  The  re¬ 
sults  of  such  an  experiment  may  be  described  by  the  Poisson 
distribution  if  the  events  occur  randomly  in  the  sense  of 
the  following  definition.  If  any  number  of  events  x  are 
observed  in  any  amount  of  time  t,  and  if  the  points  of  the 
occurrence  of  the  x  events  are  independently  and  uniformly 
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distributed  between  0  and  t,  then  the  process  may  be  de¬ 
scribed  as  random.  If  the  probability  of  the  event  is 
small  but  a  large  number  of  independent  cases  are  taken, 
the  number  of  occurrences  is  likely  to  be  distributed  in 
the  Poisscn  series.  This  distribution  may  be  thought  of 
as  an  approximation  to  the  binomial  distribution  when  the 
probability  of  occurrence  of  the  event  is  small,  that  is, 
if  Np  is  large  relative  to  p  and  N  is  large  relative  to 
Np  (where  N  is  the  number  of  trials  and  p  is  the  probabili¬ 
ty  of  the  occurrence  of  the  event  in  a  single  trial). 

Let  us  try  to  visualize  what  the  situation  might  be 
in  regard  to  demand  for  spare  parts  upon  the  Air  Force 
Supply  System.  Suppose  a  mechanic  inspects  an  actuator 
on  each  of  the  18  planes  of  a  squadron,  once  each  month 
for  two  years,  and  requests  a  replacement  part  whenever  he 
judges  that  the  actuator  has  failed.  Assume  that  the  fail¬ 
ure  rate  for  this  part  has  been  found  to  be  25  in  1000  (i.e., 
1  in  40) .  We  may  then  think  of  this  situation  as  being 
represented  by  a  binomial  distribution  (p+q),  where 
N  -  432  (It  Inspections  per  n»nth  for  24  months),  p  -  .025, 
q  ■  .975  (i.e.,  q  •  l-p,  the  probability  of  nonoccurrence). 
But  the  data  available  to  us  is  monthly  data  by  squadron, 
and  we  are  interested  in  determining  a  squadron  demand 
rate  per  month.  The  monthly  demand  data  for  the  actuator 
looks  like  this:  0—2«0-^)—l«4M)-0—l-0--<L-l-0^—2-0—0—l—l-0— 

0— 1-0—: l.  These  numbers  are  ordered  in  time,  the  first 
observation  being  January,  1956  and  the  last  one  December, 
1957. 


This  is  a  "natural'1  £or  the  use  of  the  Poisson  approx¬ 
imation  to  the  binomial,  since  Np  is  large  relative  to  p 
(432  to  1)  and  N  is  large  relative  to  Np  (40  to  1) .  If 
we  assume  that  the  Poisson  law  describes  the  data  and  use 
as  the  parameter  of  the  Poisson  the  mean  demand  per  squadron 
month,  i.e.,  18  x  .025  -  .45,  we  will  find  a  satisfactory 
fit  with  15  months  of  no  demand,  7  months  of  demand  for 
one  part,  and  2  months  of  demand  for  2  parts. 

The  Poisson  (like  the  binomial)  is  a  distribution  of 
a  discrete  variable  arising  from  enumeration  data  using 
integral  values  only.  Its  basic  characteristic  is  the 
uniform  probability  of  the  occurrence  of  the  event.  In 
contrast  to  both  the  binomial  and  the  normal  distribution, 
it  is  defined  by  a  single  parameter,  the  mean.  The  variance 
is  equal  to  the  mean.  The  Poisson  density  is  represented 
by  the  probability  function  P(x)  ■  e“m  mx/xl,  where 
x  *  0,1, 2, 3* • • ,  and  m  -  mean. 

Tables  of  the  Poisson  distribution  have  been  widely 
published;  for  example  see  (9],  [10],  [11]. 

3.  THE  NEGATIVE  MIMICAL  DISTRIBUTION 

But  the  aircraft  demand  data  described  in  Sec.  2  does 
not  present  this  picture  for  many  of  the  parts,  and  the 
Poisson  distribution  often  has  not  been  a  good  fit  for  the 
series  of  observations.  In  many  cases,  the  lack  of  fit 
was  manifested  in  more  months  of  zero  demand  than  that 
described  by  the  Poisson  distribution.  It  was  also  true 


that  we  were  getting  more  variation  than  was  permitted 
under  the  Poisson  assumption.  For  these  reasons  and 
others  that  will  be  discussed  later,  we  used  a  distribution 
known  as  the  negative  binomial  to  lit  the  observed  data. 

It  is  a  two-parameter  distribution  of  a  discrete  variable. 

The  follow?'  ng  example  illustrates  this  distribution. 

The  monthly  demand  data  from  a  squadron  of  aircraft  for  a 
door  assembly  shows  the  following  series  of  demands  over 
a  3 6- -month  per1’  od:  0—1— 0-0— 0-  -0^— 0— 0-4-0—0-.  0—0—0— 1—0— 0—0— 
0—0—0— 3—0—,  O—O—O— O—l-O— 0-O-"0— 2— 0—0 .  The  sum  of  the  de¬ 
mands  is  12.  The  mean  demand  per  month  is  one— third.  The 
assumption  of  a  Poisson  distribution,  using  1/3  as  the 
parameter  value,  does  not  give  a  good  fit  to  the  data.  The 
negative  binomial  distribution,  using  the  calculation  of 
moments  of  the  observed  distribution  to  estimate  the  param¬ 
eters.  gives  a  good  fit  to  the  data.  The  ratio  of  variance 
to  mean  is  about  2.4.  If  we  use.  an  arbitrary  ratio  of 
variance  to  mean  of  3,  the  fit  is  also  good.  The  inter¬ 
pretation  of  “good  fit'  means  that  the  amount  of  discrepance 
between  the  observed  values  and  the  theoretical  values 
based  on  the  assumed  distribution  Is  not  large  enough  to 
indicate  the  presence  of  anything  more  than  the  captlces 
of  random  sampling.  n  other  words,  the  hypothesis  is 
upheld  that  this  data  could  have  been  obtained  from  a 
population  in  which  monthly  demand  waa  described  by  a 
negative  binomial  distribution. 


The  negative  binomial  distribution  is  completely 
defined  by  two  parameters,  the  arithmetic  maan  m  and  a 
positive  exponent  k.  The  distribution  is  written 

If 

(q  —  p)  where  p  ■  ra/k  and  p  +  1  ■  q.  The  general  term 
in  the  expansion  of  this  binomial  gives  the  probability 
P  that  an  observation  x  will  have  values  0,  1,  2,  ***. 
The  general  term  may  be  written 


The  curve  defined  by  the  value  of  P(x)  is  unimodal,  so 
that  in  fitting  the  negative  binomial  to  an  observed 
di  stribution*  any  apparent  bimodal  or  multimodal  tendency 
is  attributed  to  random  sampling.  The  negative  binomial 
is  an  extension  of  the  Poisson  series  in  which  the  popula¬ 
tion  mean  m  is  not  constant  but  varies  continuously  in  a 
distribution  which  is  proportional  to  that  of  Chi-squarr 
(The  distribution  referred  to  Is  called  Pearson  Type  III 
or  Gamma  distribution).  Thus  the  negative  binomial  may  be 
used  to  represent  a  composite  of  several  Poisson  distribu¬ 
tions  in  which  the  number  of  observations  per  unit  time 
in  repeated  counts  cannot  be  assumed  to  have  the  same 
expected  valu  .mean)  in  each  unit  of  measurement.  Student 
[12]  wrote  in  1919  as  follows:  "If  the  presence  of  one 
individual  in  a  division  Increases  the  chance  of  other 
Individuals  falling  into  that  division,  a  negative  binomial 
will  fit  best,  but  if  it  decreases  the  chance,  a  positive 
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binomial Bliss  reported  [13]  on  fitting  the  negative 
binomial,  to  biological  data:  "The  negative  binomial  is 
the  easiest  to  compute  and  the  most  videly  applicable  of 
the  distributions  for  over-dispersion." 

The  negative  binomial  has  been  used  in  a  variety  of 
applications  to  biological  data.  It  was  used  by  Bliss  [13] 
to  fit  a  distribution  function  to  counts  of  red  mites  on 
apple  leaves.  It  was  used  by  Morgan  et  al  [14]  to  describe 
the  distribution  of  bacterial  clumps  over  a  milk  film. 
Student  [15]  used  it  for  a  description  of  the  counting  of 
yeast  cells  with  a  haemocy tome ter .  Stirritt  [16]  et  al. 
fouv.d  that  it  was  adequate  to  describe  the  distribution  of 
corn  borers  in  a  field  experiment.  Greenwood  and  Yule  [17 J 
used  a  negative  binomial  to  describe  the  distribution  of 
accidents  experienced  by  machinists.  (The  theory  was  that 
some  machinists  were  more  acci dept— prone  than  others. 

There  was  arso  the  possibility  that  shop  conditions  differed 
from  week  to  week  in  such  a  way  as  to  cause  accident  risks 
to  vary  during  succesfsive  weeks  of  the  period  covered  by 
the  data.  In  either  case,  the  resulting  distribution 
might  be  expected  to  be  of  the  form  of  a  negative  binomial.) 
In  plant  ecology,  quadrant  counts  which  deviated  from  the 
Poisson  distribution  were  attributed  to  the  occurrence  of 
plants  in  "clumps."  Blackman  [18]  found  that  the  distri¬ 
bution  of  plants  per  quadrant  agreed  very  well  with  a 
negative  binomial  distribution,  with  estimates  of  param¬ 
eters  derived  from  the  sample  data.  Jones,  Mollison,  and 


Quenouille  [19]  used  che  negative  binomial  in  fitting 
counts  of  soil  bacteria.  Sichel  [20]  made  use  of  a 
negative  binomial  in  studying  psychological  data  on  the 
occurrence  of  minor  accidents  in  an  industrial  plant. 

Additional  references  could  be  cited,  but  our  purpose 
is  merely  to  document  the  fact  that  the  negative  binomial 
distribution  has  been  used  for  years  in  widely  divergent 
fields  of  application. 

Here  we  are  interested  in  the  application  of  this 
particular  distribution  to  the  demand  for  aircraft  spare 
parts.  We  shall  first  discuss  the  rationale  which  leads 
us  to  believe  that  it  may  offer  a  reasonable  description 
of  spare-parts  demand. 

Suppose  that  two-years'  data  on  demand  for  spare 
parts  is  available  at  a  base.  For  illustrative  purposes, 
we  will  aay  that  it  is  a  bate  with  four  squadrons,  each 
consisting  of  18  aircraft.  That  is,  the  data  covers  the 
demands  of  72  planes  for  24  months.  If  all  the  aircraft 
wera  identical,  of  tha  same  age,  flew  Identical  missions, 
undarwent  identical  servicing,  were  subject  to  tha  same 
maintenance  practices,  ate.,  the  demand  for  actuator  parts 
might  be  expected  to  be  about  the  same  as  in  tha  illustra¬ 
tive  example  on  page  4.  In  this  case,  tha  monthly  demands 
would  follow  the  Foikson  distribution.  If  the  population 
were  homogeneous,  random  samples  of  demand  data  would  be 
expected  to  exhibit  only  the  sampling  fluctuations  in¬ 
herent  in  any  well-behaved  variable.  But  the  homogeneity 


described  above  is  not  present  in  the  Air  Force  environment 
in  which  the  demands  are  generated.  For  example ,  even 
though  the  planes  fly  equal  numbers  of  hours,  make  the  same 
number  of  landings,  etc.,  some  planes  are  nevertheless 
subject  to  greater  stresses  than  others  because  of  factors 
of  speed,  altitude,  rate  of  climb,  etc.  The  probability 
of  failure  of  a  particular  part  is  no  longer  constant,  but 
has  increased  due  to  the  stress  factor.  The  recorded 
demands  at  the  base  for  this  part  over  the  24-month  period 
might  be  as  follows:  1-0— 7-0-0— S-i—2-0-0— 6-1— ,  0-1-0— 4— 1-1- 
2-0— 9— 2-0— 3.  The  sum  of  the  demands  at  the  base  is  44. 
There  arc  four  squadrons,  so  that  the  demand  rate  per 
squadron  month  is  the  same  as  in  the  Poisson-fitted  example 
on  p.  4. 

For  these  demand  data,  the  Poisson  is  not  a  good  fit. 

2 

If  we  use  the  X  statistic  to  test  the  agreement  of  the 

observed  distribution  with  the  theoretical  Poisson  distri- 

bution,  the  computed  x  value  for  1  d.f.  is  8.3.  The 

2 

probability  of  obtaining  a  x  value  of  this  magnitude  or 
greater  in  drawing  random  samplas  from  a  homogeneous 
population  is  less  than  .01.  Such  large  values  of  chi- 
square  may  signify  no  more  than  the  presence  of  an  un¬ 
usually  divergent  sample,  but  to  the  investigator  who  is 
none  too  sure  of  his  hypothesis,  the  presence  of  repeated 
large  chi-equare  values  indicates  that  the  hypothesis 
should  be  rejected. 
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However,  if  we  use  the  negative  binomial  probability 
distribution  with  the  same  mean  demand  per  month  and  assume 
that  the  ratio  of  variance  to  mean  is  3,  we  will  obtain  a 
good  fit  for  the  sample  data,  (x  is  .17  for  1  d.f., 

P  =  . 70 . )  The  discrepance  between  the  observed  frequency 
distribution  of  demands  per  month  and  the  theoretical 
frequencies  based  on  the  negative  binomial  is  very  small 
indeed.  The  ratio  of  sample  variance  to  sample  mean  in 
this  case  was  about  3.2.  A  word  of  caution  may  be  appro¬ 
priate  here  concerning  inferences  made  about  the  population 
on  the  basis  of  the  chi-square  test  of  goodness  of  fit. 

The  viewpoint  adopted  is  that  the  hypothesis  is  fixed — 
namely,  that  a  negative  binomial  distribution  describes 
the  population  of  monthly  demanus  for  the  part.  The 
probability  evaluated  above  is  that  of  drawing  from  the 
population  a  sample  more  extreme  than  the  one  in  hand.  It 
is  not  a  method  for  evaluating  the  probability  that  the 
hypothesis  is  correct.  Of  course,  it  is  true  that  after 
the  evidence  from  samples  is  accumulatei  the  analyst- 
researcher  mist  make  a  decision  about  the  hypothesis,  and 
his  decision  has  some  presumably  high  probability  of  being 
correct;  but  we  have  not  presented  a  method  of  evaluating 
such  a  probability. 

There  are  many  factors  in  the  Air  Force  environment 
that  contribute  to  heterogeneity  of  demands.  Among  these 
factors  we  might  list  age  of  aircraft,  applicability  of 
parts,  flying-program  elements,  nature  of  mission. 


servicing  schedules,  maintenance  practices,  design  change 
and  modifications,  changes  in  base  personnel,  etc.  A 
crew  mechanic  inspecting  plane  number  1  may  not  have  the 
same:  careful  standards  as  the  mechanic  who  inspects  number 
2.  Some  inspectors  are  prone  to  replace  narts — some  are  not. 
Demands  for  certain  spare  parts  have  also  been  known  to 
exhibit  a  tendency  to  occur  in  clusters — that  is, the  demand 
for  a  unit  of  part  a  stimulates  demand  for  a  unit  of  part  b, 
which  in  turn  creates  a  demand  for  4  units  of  part  c,  etc. 

All  of  these  factors  change  with  time,  and  are  reflected  in 
the  sample  data. 

4.  TABLES  OF  NEGATIVE  BINOMIAL  DISTRIBUTION 

We  have  found  tables  of  the  negative  binomial  distri¬ 
bution  useful  in  our  work  in  logistics.  Since  they  may 
not  be  readily  accessible,  the  complete  probability  distri¬ 
butions  for  a  limited  number  of  arbitrary  paramater  values 
are  presented  in  the  following  pages.  We  have  used  13 
different  values  of  the  mean  (m  ■  kp)  and  10  different 

ratios  of  variance-to . mean  (q) .  This  gives  us  130  sets 

of  parameter  values  for  which  the  complete  probability 
distribution  la  tabulated. 

The  distribution  function  of  the  negative  binomial  is 

P(x)  -  J&gdll 
(fe-1)'.  x* 

where  x  -  0,1,2,3"», 


p,k  >  0,  and  q  *  1  +  p. 


Part  1  gives  the  individual  terms  of  the  distributions 
and  Part  2  gives  the  cumulative  probability  of  x  demands 
co:  less.  The  values  of  the  mean  (kp)  are  .25 (.25)  1.0  and 
1.0(1  0)10.0.  The  values  of  the  ratio  of  variance  to  mean 
(q)  are  1.5(. 5)5.0  and  5. 0(1. 0)7.0.  The  choice  of  these 
values  makes  p  vary  from  1/2  to  6  and  k  from  ^  to  20. 
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